On consistency and robustness properties of Support Vector Machines for heavy-tailed distributions

نویسندگان

  • Andreas Christmann
  • Arnout Van Messem
  • Ingo Steinwart
چکیده

Support Vector Machines (SVMs) are known to be consistent and robust for classification and regression if they are based on a Lipschitz continuous loss function and on a bounded kernel with a dense and separable reproducing kernel Hilbert space. These facts are even true in the regression context for unbounded output spaces, if the target function f is integrable with respect to the marginal distribution of the input variable X and if the output variable Y has a finite first absolute moment. The latter assumption clearly excludes distributions with heavy tails, e.g., several stable distributions or some extreme value distributions which occur in financial or insurance projects. The main point of this paper is that we can enlarge the applicability of SVMs even to heavy-tailed distributions, which violate this moment condition. Results on existence, uniqueness, representation, consistency, and statistical robustness are given.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A review on consistency and robustness properties of support vector machines for heavy-tailed distributions

Support vector machines (SVMs) belong to the class of modern statistical machine learning techniques and can be described as M-estimators with a Hilbert norm regularization term for functions. SVMs are consistent and robust for classification and regression purposes if based on a Lipschitz continuous loss and a bounded continuous kernel with a dense reproducing kernel Hilbert space. For regress...

متن کامل

Qualitative Robustness of Support Vector Machines

Support vector machines have attracted much attention in theoretical and in applied statistics. Main topics of recent interest are consistency, learning rates and robustness. In this article, it is shown that support vector machines are qualitatively robust. Since support vector machines can be represented by a functional on the set of all probability measures, qualitative robustness is proven ...

متن کامل

نمودار شوهارت ناپارامتری رتبه علامت دار با فاصله نمونه گیری متغیر

Nonparametric control chart based on rank is used for detecting changes in median(mean). In this article ,Signed-rank control chart is considered with variable sampling interval. We compared the performance of Signed-rank with variable sampling interval (VSI-SR) to Signed-rank with Fixed Sampling interval (FSI-SR),the numerical results demonstrated the VSI feature is so useful. Bakir[1] showed ...

متن کامل

Predicting tensile strength of rocks from physical properties based on support vector regression optimized by cultural algorithm

The tensile strength (TS) of rocks is an important parameter in the design of a variety of engineering structures such as the surface and underground mines, dam foundations, types of tunnels and excavations, and oil wells. In addition, the physical properties of a rock are intrinsic characteristics, which influence its mechanical behavior at a fundamental level. In this paper, a new approach co...

متن کامل

Remote Sensing and Land Use Extraction for Kernel Functions Analysis by Support Vector Machines with ASTER Multispectral Imagery

Land use is being considered as an element in determining land change studies, environmental planning and natural resource applications. The Earth’s surface Study by remote sensing has many benefits such as, continuous acquisition of data, broad regional coverage, cost effective data, map accurate data, and large archives of historical data. To study land use / cover, remote sensing as an effic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009